The Annals of Statistics 

2006, Vol. 34, No. 2, 994-1012 

DOI: 10.1214/009053606000000191 

© Institute of Mathematical Statistics, 2006 
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Technology 

This paper addresses the problem of fitting a known distribution 
to the innovation distribution in a class of stationary and ergodic 
time series models. The asymptotic null distribution of the usual 
Kolmogorov-Smirnov test based on the residuals generally depends 
on the underlying model parameters and the error distribution. To 
overcome the dependence on the underlying model parameters, we 
propose that tests be based on a vector of certain weighted resid- 
ual empirical processes. Under the null hypothesis and under mini- 
mal moment conditions, this vector of processes is shown to converge 
weakly to a vector of independent copies of a Gaussian process whose 
covariance function depends only on the fitted distribution and not 
on the model. Under certain local alternatives, the proposed test is 
shown to have nontrivial asymptotic power. The Monte Carlo criti- 
cal values of this test are tabulated when fitting standard normal and 
double exponential distributions. The results obtained are shown to 
be applicable to GARCH and ARMA-GARCH models, the often 
used models in econometrics and finance. A simulation study shows 
that the test has satisfactory size and power for finite samples at 
these models. The paper also contains an asymptotic uniform expan- 
sion result for a general weighted residual empirical process useful in 
heteroscedastic models under minimal moment conditions, a result 
of independent interest. 



1. Introduction. Let {yi : z G Z := 0, ±1, ±2, . . .} be a strictly stationary 
and ergodic real time series. Often the finite-dimensional distributions of 
such series are characterized by the stationary distribution and the condi- 
tional distribution of yi, given the past. One problem of interest is to fit this 
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conditional distribution. In general, this is a difficult problem. However, in 
some special time series models where this conditional distribution is de- 
termined by the innovation distribution, it is possible to obtain reasonable 
answers. In particular, in this paper we shall focus on the generalized au- 
toregressive conditionally heteroscedastic (GARCH) and ARMA-GARCH 
models. 

To describe these models, let q,r be known positive integers and Oi(02) 
be a subset of the g(r)-dimensional Euclidean space M'^(R'^), and let G = 
01 X ©2. In the models of interest one observes the process yi such that, for 
some sequences of past measurable functions /ij from ©i to M and hi from 
to M+ := (0,oo), and for some e' = {e[,e'^), Oi G ©i, 62 G ©2, 

(1.1) m:= ''~r^\ ^eZ, 

are independently and identically distributed (i.i.d.) standardized r.v.'s. 
Here, "past measurable" means that, for every s := (si,S2) G ©,si G ©1, 
the functions ^j(si) and /ij(s) should be J^i-i measurable, where J^i is the 
(T-field generated by {r^j, 7yj_i, . . . ,yo, y-i, • • •}, i G Z. Let F denote the com- 
mon distribution function (d.f.) of the errors {r/j}, and Fq be a known d.f. 
The problem of fitting the conditional distribution of yi, given J-i-i, in the 
model (1.1) is now equivalent to testing the goodness-of-fit hypothesis 

Hq:F = Fq vs. Hi-.Fj^Fq. 

The knowledge of the error distribution is important in statistics, in par- 
ticular, in value at risk (VaR). In economics and finance, VaR is a single 
number measuring the risk of a financial position. For example, when yi is a 
process of daily returns, the VaR for a one-day horizon of a portfolio is the 
95th conditional quantile of the distribution of yi+i, given the information 
available at time i. After estimating the parameter 9, the VaR for a one- 
day position of yi and probability 0.05 is /Uj(0i) — 1.6449-\//ij(0), provided 
the r/j's have the standard normal distribution and the estimated parame- 
ter 6 is correct. This means that, with probability 0.95, the potential loss 
of holding that position the next day is yii{9i) — 1.6449 x yjhi{6). Clearly, 
the knowledge of the error distribution plays a crucial role in determining 
this probability, and, hence, in evaluating VaR via model (1.1). Thus, it is 
important to test the hypothesis Hq in practice. For more on VaR, see, for 
example, [29]. 

The goodness-of-fit testing problem under the i.i.d. setup has a long his- 
tory; see, for example, a collection of papers in [7], and references therein. 
A commonly used test is based on the Kolmogorov-Smirnov statistic. The 
primary reason for this is that this test is distribution free, that is, the null 
distribution of this statistic does not depend on Fq. However, when the i.i.d. 
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sequence such as r]i in the model (1.1) is not observed and has to be esti- 
mated from a special model, this property of being distribution free is lost 
even asymptotically. These kinds of problems have been extensively investi- 
gated in the literature in various models, including regression models; see, 
for example, Durbin [9], Loynes [28] and Koul [14], among others. 

In the context of time series models, Boldin [2] observed that for the 
zero mean linear autoregressive (AR) models the tests based on the residual 
empirical process are asymptotically distribution free (ADF) for fitting an 
error d.f. as long as it has zero mean, finite variance and bounded second 
derivative. The condition of bounded second derivative was later relaxed 
in [13] to requiring having only a uniformly continuous density, while the 
condition of having zero mean is crucial for the validity of the ADF property. 
Boldin [3] and Koul [14] observed that similar facts hold for moving-average 
(MA) models. However, this is no longer true of many nonlinear time series 
models such as threshold AR models as noted in [15], nonstationary AR 
models in [21], ARCH/GARCH models in [4, 5, 11], [16], Chapter 8 and [1]. 
For ARCH/GARCH models, Horvath, Kokoszka and Teyssiere [12] proposed 
a parametric boostrap method for testing Gaussianity of the errors, but 
only studied its validity via Monte Carlo experiments. Some of the results 
of Kulperger and Yu [18] can be used to test for the error moments but not 
for the error d.f. 

The main difficulty is that the asymptotic null distribution of the empir- 
ical process of the residuals in the heteroscedastic time series model (1.1) 
depends not only on the distribution Fq, but also on the model functions 
Hi and hi. Under certain regularity conditions, the dependence on fii can 
be eliminated by using a vector of certain weighted residual empirical pro- 
cesses, as is shown here. Roughly speaking, the weights in these processes 
are asymptotically orthogonal to the space generated by the slopes of the 
functions /ij. Moreover, under Hq this vector of processes is shown to con- 
verge weakly to a vector of independent copies of a Gaussian process. The 
covariance function of this Gaussian process depends only on Fq and is the 
same as that of the one arising when fitting a d.f. up to an unknown scale 
parameter. The Monte Carlo critical values when Fq is either the standard 
normal or double exponential d.f. are tabulated. Under certain local alter- 
natives, the proposed test is shown to have nontrivial asymptotic power. 

Section 2 describes the test statistic and the main results. Section 3 
shows that our assumptions are naturally satisfied by a class of GARCH 
and ARMA-GARCH models. Some simulation results and an application of 
the proposed test to the Hang Seng Index in the Hong Kong stock market 
are also given in Section 3. Section 4 contains some proofs. Theorem 1.1 
of [17] has been found very useful in many nonlinear homoscedastic time 
series models and has been extended for nonstationary time series in [27]. 
Theorem 4.1 below gives its extension to heteroscedastic time series models 
and is used to prove the main results of Section 2. 
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2. Test statistics and the main results. This section describes the pro- 
posed test statistics and their asymptotic behavior. 

Let p be a positive integer and Yq denote either the vector (yo, U-i, • • • , Ui-p)' 
or the vector [i/q, . . .)'. In either case Yq is chosen to be independent of 
r]i,i > 1. Let {yi, . . . be observations obeying the model (l-l)- We shall 
assume the following: 

(2.1)The functions fj.i,hi are twice continuously differentiable, for all i. 

Let £i{si) =yi- fii{si), r]i{s) := ei{si)/^yhi{s), 1 < i < n. For any differ- 
entiable function g on Q, let g denote its differential. Thus, for example, 
fj^iisi) = diii{si)/dsi, si £ 01. Moreover, for an s G 0, we write 

hki{s):=^^, k = l,2, l<i<n. 

dsk 

Thus, fii{s) and /ii(si) are (g + r) x 1 and qxl vectors, respectively. Simi- 
larly, hii{s) and h2i{s) are g x 1 and r x 1 vectors, respectively. We denote 
the true parameter by and rn = r]i{6). 

We need the following additional assumptions: 

Fq has an absolutely continuous density /q with sup|a;|/o(x) < oo and 
(2.2) 

has positive and finite Fisher information for location and scale, that is, 
0< jUo/k?dFo<'x, 0< l(^l + ^M^ydFoix)<oo. 

(2.3) 

where ipo ■= /o//o and ipo{x) := (1 + 3;V'o(a^))/2, x G M. 

For example, under (2.2), (2.3) holds whenever Fq is symmetric around 
zero. Let 



[ hM/h.{s)) [ W22AS) 

I{e) := EeWi{e)BWi{9y . 



say, 



Under Hq, B is known and I (9) is the Fisher information matrix appropri- 
ate in the present setup. We shall assume that, under Hq, I{0) is positive 
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(2.4) 



n 



i=l 



We are now ready to introduce the needed weighted empirical processes, 
1 " 

Kn{x, s):=-^Y. W22 As)[I{Vi{s) <x)- Fo(x)], x G M, s G 9. 

Note that this is an r x 1 vector of processes. The tests of Hq will be based 
on Kn{x,9n)- For example, the Kolmogorov-Smirnov test statistic is 

_ 1 " 

Kn = Snp[Kn{x,9nyi-^Kn(,X,9n)], T„ := — V T^22,i(0n)T^22,i(en)'. 

4n 



i=l 



The following theorem is useful in obtaining the limiting distribution of the 
tests based on the process Kn{-,9n)- In it the matrix of the second derivative 
of hi is denoted by hi and the matrix norm is the Euclidean norm. 

Theorem 2.1. Suppose the model (1.1) and (2.1)-(2.4) hold. In addi- 
tion, suppose the following holds. For an open neighborhood Uq of 9, 



(2.5) Ee sup < 00, 

(2.6) Eq sup , , , < cx). 



seUg hi{s)hi{9) 
Ee sup , < 00. 

s€Ug hf{s) 



Then, for every < 6 < 00, 
Kn{x,9 + n~^/^t) 

I 

(2.7) =i^n(x,0) + -n"i^t^22,*(^) 



1=1 



=/o(x) +x/o(x) 



2 K{9) 



where Up{\) is a sequence of stochastic processes tending to uniformly in 
X € K and ||t|| < 5, in probability. 



A proof of this theorem is sketched in Section 4. Here we shall now illus- 
trate its usefulness in obtaining the limiting null distribution of Kn- 

Now, consider the coefficient of t/2 in the second term on the right-hand 
side of (2.7). By the ergodic theorem, uniformly in x G M, it converges to 



(2.^ 



EeW22,i{9) 



^^/o(x)+x/o(x)-^ 
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To reduce the effect of the location related parameters, namely, of 6i and 
fii{9i), on this expression, it suffices to assume 

(2.9) Ee{[Wu,i{e),Wi2,imW22Aey} = 0. 

Under this assumption, the expression in (2.8) is equivalent to 

^xfo{x)[0',EeW22,mW22,m'] = k^fo{x)[0',H'e], say. 

Next, let hii9) ■.= biEeWuA0)Wii,i{ey + b2EeW2i,ii9)W2i,i{ey. Un- 
der (2.9), 

fW2i.i{0y 

iie)=\ inie) 

V b2He, 

Moreover, the first term on the right-hand side of the expansion (2.4) now 
becomes 



l/2^fInmWll,^Mm) + Wl2,mMv^)) 

^ \ {b2He)-^W22MMVi) 



-n 

1=1 



Upon combining these facts with (2.7), we obtain that, uniformly in x G M, 

Knix, On) = Kn{x, 9) - V t^22,*(e)</^o(r/i) + Op(l) 

402 V'" i=l 



i=l 



/(??i <x)- Fo{x) — — ipo{r]i 



+ Op(l). 



Let Zn{x) denote the vector of the leading process on the right-hand side 
above, and 



p{x,y) :-- 



Fo{x Ay)- Fo{x)Fo{y) - ■^xyfo{x)fo{y) 

402 



x,yG 



Let Z he a vector of r independent mean zero Gaussian processes, with 
Cov(Z(a::), Z{y)) := p{x, y)Irxr- Using a conditioning argument and the above 
weak convergence result, one readily obtains that Cov(Z„(x), Z„(?/)) = 
^p{x,y)H0, and that 2Hg Kn{x,6n) =^ Z{x). This fact, together with 
the fact that ||2„ — iie\\ = Op(l), yields the following corollary. 

Corollary 2.1. Suppose the assumptions of Theorem 2.1 and Hq hold. 
If in addition (2.9) holds, then Kn sup^^^ ||Z(x)|| =: K. 

Observe that this limiting distribution depends only on the error d.f. 
Fq. Thus, its critical values can be approximated by simulation. When 
rji ~ A^(0, 1), the corresponding critical values are given in Table 1. This 
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table is constructed as follows. First, we approximate {Z{x) :x G [—4.0,4.0]} 
by {Z{xi) : i = 1, . . . , 2000} with Xi+i - Xi = 8/2000. The distribution of K 
is then approximated by sup]^<j<2ooo We use 10,000 independent 

replications to obtain the percentages of K. By increasing the ranges of x 
and i and the number of replications, the percentages of K have only ignor- 
able differences from those in Table 1. In the case fo{x) = e"'^' /2,x G M, the 
corresponding critical values are in Table 2. The simulation method is the 
same as that for Table 1 except now the range of x is taken to be [—8.0,8.0], 
since this density has a longer tail than the standard normal density. For 
other distributions, the critical values can be obtained through a similar 
method. 

We now indicate the behavior of the asymptotic power of this test under 
the local alternatives Hin : F„ = (1 - n~^/^5)Fo + n~'^/'^6F, where <6 <1 
and F is a d.f. Assume that {ijiii > 1} are independent of Yq under Hin- 
Let Pon and Pin be the joint distributions of (yi, . . . , ?/„) under Hq and Hin, 
respectively. In Section 5 we show that Pon and Pin are contiguous in the 
sense of Le Cam [19] . The following theorem gives the asymptotic power of 
the test statistic Kn against these local alternatives. 

Theorem 2.2. In addition to the conditions of Theorem 2.1, assume 
that F has zero mean and finite variance. Then, under Hin, Kn{x,9n) 



Table 1 

Upper percentage points of K with Fq = A'^(0, 1) 



\ ^ 

OL \ 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


0.01 


2.465 


3.150 


3.737 


4.361 


4.769 


5.173 


5.642 


6.065 


6.508 


6.938 


0.03 


1.890 


2.595 


3.100 


3.640 


4.094 


4.468 


4.946 


5.336 


5.677 


6.083 


0.05 


1.650 


2.289 


2.804 


3.267 


3.751 


4.118 


4.553 


4.922 


5.298 


5.679 


0.10 


1.317 


1.891 


2.382 


2.822 


3.262 


3.628 


4.033 


4.384 


4.716 


5.106 


0.15 


1.113 


1.666 


2.126 


2.552 


2.980 


3.325 


3.685 


4.022 


4.367 


4.720 


0.20 


0.988 


1.498 


1.931 


2.339 


2.750 


3.089 


3.436 


3.768 


4.101 


4.445 



Table 2 

Upper percentage points of K with Fq = double exponential distribution 



\ ^ 
a \ 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


0.01 


2.402 


3.149 


3.702 


4.298 


4.809 


5.173 


5.683 


5.937 


6.360 


6.845 


0.03 


1.876 


2.523 


3.073 


3.569 


4.015 


4.399 


4.872 


5.260 


5.611 


6.015 


0.05 


1.630 


2.250 


2.781 


3.218 


3.680 


4.067 


4.500 


4.865 


5.247 


5.607 


0.10 


1.299 


1.873 


2.344 


2.788 


3.222 


3.605 


3.969 


4.335 


4.680 


5.051 


0.15 


1.098 


1.640 


2.092 


2.533 


2.933 


3.279 


3.639 


3.991 


4.317 


4.691 


0.20 


0.961 


1.464 


1.902 


2.314 


2.703 


3.046 


3.399 


3.729 


4.065 


4.046 
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^Hq' Z[x), where Z{x) is an r-vector of Gaussian processes with mean vec- 
tor mg{x) := EgW22,i{(^){F{x) — Fq{x)), and covariance matrix equal to that 
of the process Z{x) in Corollary 2.1. 

This theorem shows that the test statistic Kn has nontrivial asymptotic 
power under the specified local alternatives, provided mg{x) / 0, for some 



3. Some applications. This section verifies the applicability of the results 
in the previous section to the GARCH and ARMA-GARCH models. These 
models are among the most important models in the field of time series, 
econometrics and finance. 

3.1. GARCH models. This subsection considers Bollorslev's GARCH {pi,P2) 
model defined by the equations 



where r]i are standardized i.i.d. r.v.'s. Clearly, it is an example of the model 
(1.1) with fj.i = = q, hi as given above, and s = S2 = {ao,ai, . . . ,apj,/9i, 
. . . ,f3p^y is the unknown parameter vector and its true value is 6 = 62 = 
(aoo , aoi , • ■ • , oopi , Poi,---, P0P2)' ■ The parameter space is 6 = {s : YljLi + 
Ejii/3j < Po, ^0 < "0 < ^0, and ^ < Qj, /?j < ^, i = 1, . . . ,pi,j = 1, . . . ,^2} 
for some constants po G (0, 1), < /3g < /3o and < /3 < /3 < 1. Note that 

is compact. Assume that 9 is an interior point of and Y^^=i'^j^'' and 

1 — X^jLi l^i^'' have no common zeros for each s G 0. Bougerol and Picard [6] 
have shown that, under these conditions, the model (3.1) is strictly station- 
ary and ergodic with Egy^ < 00. We now verify the remaining assumptions 
of the previous section. Let 



where B denotes the backward shift operator. Then from (3.1) one has 



(3.1) 







say. 



FITTING ERROR D.F. IN GARCH MODELS 



9 



dhi{s 



dao 
dhi{s 



dak 
dhi{s 



P2 



dan 



P2 



j=i / 



P2 



-1 



V j = l / 



1/2 



P2 



d(5k 



^ j=i J 



9Pl 



P2 



dPk 



\ j=i , 



dhi{s) 
dl3k ' 



Similar expressions can be obtained for d'^hi{s) / da\ and d^hi{s)/dakdPk' : 
1 < k < pi,l < k' < p2. From these expressions, it follows that the above 
{hi{s)} satisfy (2.1). From Lemma 2.1 in [22] and the compactness of G, it 
follows that, for some < p < 1, supj,g0 Poj{s) = 0{p') as j oo. 
Next, note that here Wu^i{s) = 0, Wi2,i{s) = and 



hi{s) 



dhi{s) dhi{s) dhi{s) dhi{s) 
daQ ' dai 



dhi{s) 



dap^ df3i 



da 



P2 



1 dhijs) 
hi{s) ds 



Thus, (2.9) is a priori satisfied. Further, assume that Er]f < oo. Francq and 
Zakoian [10] show that the quasi-MLE is -^/n-consistent. Using this quasi- 
MLE as an initial estimator and one-step iteration as in (3.1) in [23], we 
can obtain a new estimator 6n- Ling [23] showed that this satisfies the 
condition (2.4) under (2.2). 

Next, we verify (2.5)-(2.6). Because fii = 0, the first part of (2.5) is a 
priori satisfied. By Lemma A. 2 in [24], there are a neighborhood Ug of and 
a finite universal constant C and a p £ (0,1) such that, for any 7 > 0, and 
for all i, 



max[ sup 



1 d^h,{s) 



hi{s) dsds' 



. sup 

seUg 



1 dhi{s) 



hj(s) ds 



<C(1 + J2f^\y,.j\ 
\ j=i 



Thus, (2.6) holds. By Lemma A.l(ii) and A. 5 in [24], there exists a constant 
?E (0,1) such that sup,^UgMs)/hi{e)]<C{l + j:^^^i^\e,.j\)^-^^. Taking 
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7 < it follows that 



\\hi{sW C ^ 

sup , , < Eg sup 

s€Ug n'i[s)hi[0) ao seUe 



hi{e) 



. j=i 



< oo. 



Thus, (2.5) holds. This completes the verification of all the assumptions of 
Corollary 2.1. 

3.2. ARMA(1, 1)-GARCH(1, 1) models. Consider the ARMA(1, 1)- 
GARCH(1, 1) model defined by the equations 

Ui = ayi-i + bei-i + ei, 
(3-2) ^ 

f^i = 111^^1 and /ij = Qo + aej_i + 

where ryj is a sequence of i.i.d. standardized r.v.'s and symmetric around 
zero. We shall assume the usual condition: a,(3 £ (0,1), a,b£ (—1,1), and 
a + 6 7^ 0. This model is an example of the model (1.1) with q = 2, r = 
3, si = (a, 6)', S2 = (ao,a,/3)', s = (si,S2)', ei(si) = Ui - o-Vi-i - tej-i(si), 
/ij(si) = yi — ei{si), and hi{s) = ao + aef_i{si) + /3hj_i(s). We shall assume 
that, for some positive numbers ao,ao, ai,ai, < ao < oq, 0.1 < a < ai, 

< /? < «! and a + P < po with /?o S (0, 1). The parameter space is now 
the subset of whose members satisfy these conditions and is a priori 
compact. The true parameter values 9' = (^1,^2) taken to be in the 
interior of G, where 61 = (ao,6o)'i ^2 = («005 /^o)'- From [25], it follows 
that, under these conditions, the model (3.2) is strictly stationary, ergodic, 
and that EqUq < 00. 

To verify the remaining assumptions of the previous section for this model, 
proceed as follows. From (3.2), one obtains 

g»(gi) , ^ e»-i(^i) \^L7--i 

^ — = 2aej_i(si) — hp 5 = 2a > /^-^ ei-j(si) — ^ , 

^ = 2ae._i(.0^^ = 2aE/3^ ^-^(^1)^^' 



^/ti(g) 
9ao 



(1-/5)-', 
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dhd 



da 

dhjjs) _ 
dp 

Thus, here 



/ii_i(s) + /3 



da 
dhi^i{s) 



oo 



d/3 



W21AS) 



W22AS) 



1 



Vhiis) 



dei{si) dei{si) 



da 



db 



1 



dfii{s) 



hi{s) 



h,{s) 



dhi{s) dhi{s) 



da 



db 



Vhiis) dsi 
1 dh^is) 



hi{s) dsi 



dhi{s) dhi{s) dhi{s) 



dan 



da 



dp 



1 dh,{s) 
hi{s) ds2 



The symmetry of Fq around zero imphes (2.9) here. Further, assume that 
E-qf < 00. Ling [24] showed that the self- weighted quasi- MLE is -^/n-consistent. 
Using this estimator as an initial estimator and one-step iteration as in (3.1) 
in [23], we can obtain a new estimator 0^ . Ling [23] showed that this 0^ 
satisfies condition (2.4) under (2.2) for more general fractional FARIMA- 
GARCH models, which includes (3.2) as a special case. 

Next, we verify (2.5) and (2.6). Since Ee1{6i) < 00, there exist some 
< C < 00 and < p <1 such that 



(3.3) Eg sup 

s€Ug 



< Eg sup ||/il(s) 

s&Ue 



< 



Coo 
E 
k=l 



P^\£i~k\ 



< 00, 



(3.4) Eg sup 

s€Ug 



1 dhi 



\/hi{s) dsi 



<CEg 
<CEg 



sup E^ 



(i-i)/2 



seu. 



dsi 



1 2 



1 2 



k=l 



< 00. 



By Lemma A. 2 in [24], there are a neighborhood Ug of 9 and universal 
constants 0<C<oo,0</3<l, such that, for any 7 > 0, and uniformly in i 
and s, \\h-^{s)dhi{s)/ds2\\ < C{1 + ZT=i P^lyi-jl^ ■ Lemma A.l(ii) and 
A. 5 in [24], there exists a constant t € (0, 1) such that supg^^g[hi{s)/hi{6)] < 
^"^^ Thus, taking 7 < it follows that 



0(1 + ^7=11^^-3 

(3.5) 



E sup , , . 

s&Ue hi{e)hi[s) 



dhi{s) 



ds-2 



< 00. 



12 



H. L. KOUL AND S. LING 



From (3.3)-(3.5), we thus obtain that Egsups^ij^[\\hi{s)\\'^ /{hi{s)hi{e))] < 
aQQEgsupg^ljg\\hi{s)\\'^/hi{s) < oo, thereby verifying (2.5) in the present 
case. The condition (2.6) is verified similarly, thereby completing the verifi- 
cation of the assumptions of Corollary 2.1 in this example. 

3.3. Empirical results. This section first examines the performance of 
the test statistic Kn in finite samples through Monte Carlo experiments. 
The following AR(1)-GARCH(1, 1) model is used: 

(3.6) yi = ayi_i + £i, ei = r]i^/h'i, hi = + aef_i + f3hi_i, 

where the true parameters are {a,ao,a, 13) = (0.5,0.025,0.25,0.5) and r/j are 
i.i.d. In the experiments, we take the sample sizes n = 200 and 400. One 
thousand replications are used. The null distribution of rji is standard normal 
and its alternatives are given as follows: 

Ai: ^3/5*5, A2: ^Jl/2U, ^3: ^1/3*3, 

A4: double exponential, A^: [0.5iV(-3, 1) + 0.5iV(3, 1)]/Vl0. 

The experiments are carried out using Fortran 77 and the optimization al- 
gorithm from the Fortran subroutine DBCOAH in the IMSL library. From 
Table 3, we can see that the size of iir„-test is somewhat conservative when 
n = 200. However, when the sample size n increases to 400, the size is very 
close to the nominal significance level, for all selected levels. The power of 
this test depends on the shape of the alternative distribution. It increases as 
one moves from Ai to A^ . This is reasonable since the difference between the 
shapes of A^(0, 1) and ^3 is more than that of the shapes of A^(0, 1) and Ai, 
and similarly for A2 and A3. At the alternative A4, the power ranges between 
its values at the alternatives A2 and A^. The test reaches its highest power 
when the alternative is mixed-normal, that is, A^. Some simulation results 
not reported here show that the empirical size and power of Kn change a 
little when the true parameters {a^a^^a, (3) are changed in the stationary 
region. These simulations indicate that the proposed test has satisfactory 
size and power behavior in finite samples, and should be useful in practice. 

We next use model (3.6) to investigate the Hang Seng Index (HSI) in 
the Hong Kong stock market. Each period of two years from 1/6/1988- 
31/5/1996 is considered. The results are summarized in Table 4. In this 
table, the values in parentheses are the corresponding asymptotic standard 
deviations of the estimated parameters, LF is the value of log-likelihood 
function and QM is the portmanteau test statistic as in [20]. Both statistics, 
QM(6) and QM(12), suggest that this model fits the data adequately. All 
these estimators of the parameters (a, (5) satisfy the finite fourth moment 
condition: 3a^ -|- 2a/3 + /3^ < 1. We use the statistic Kn to test the hypothesis 
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Table 3 

Size and power of test statisitc Kn for rji m AR(1) -GARCH(1, 1) models: 1000 

replications 





n = 


200 








n = 400 




\ 0.1 

'/« \ 


0.05 


0.01 




0.1 


0.05 


0.01 










Size 








N{Q,l) 0.089 


0.041 


0.006 




0.102 


0.053 


0.008 










Power 








Ai 0.171 


0.086 


0.021 




0.348 


0.226 


0.058 


A2 0.309 


0.180 


0.056 




0.590 


0.453 


0.200 


^3 0.570 


0.434 


0.201 




0.909 


0.882 


0.581 


A4. 0.407 


0.247 


0.060 




0.793 


0.640 


0.283 


A5 1.00 


1.00 


1.00 




1.00 


1.00 


1.00 








Table 4 










Empirical resu 


'Its for Hong Seng Index fitted AR(1)-GARCH(1, 1) models 




Periods n 


a 


ao 


a 


/3 


LF 


QM(6) QM(12) 




1/6/88-31/5/90 493 


0.242 


0.083 


0.223 


0.772 


-117.9 


5.6 12.3 


6.44 




(0.055) 


(0.027) 


(0.048) 


(0.031) 








1/6/90-31/5/92 495 


0.186 


0.526 


0.203 


0.442 


-131.7 


2.9 7.6 


4.67 




(0.056) 


(0.162) 


(0.070) 


(0.145) 








1/6/92-31/5/94 498 


0.117 


0.322 


0.242 


0.664 


25.5 


6.3 14.5 


1.74 




(0.050) 


(0.108) 


(0.057) 


(0.068) 








1/6/94-31/5/96 497 


0.128 


0.053 


0.069 


0.900 


-95.6 


7.9 12.9 


2.16 




(0.048) 


(0.029) 


(0.025) 


(0.034) 









Ho:7]i ~ A^(0,1). Upon comparing the values of Kn in Table 4 with the 
critical values obtained in Table 4, one observes that the test Kn rejects Hq 
at significance level 0.01 during the first two periods, while it accepts Hq 
at significance level 0.10 during other two periods. Note that the Tian An 
Men Square event occurred during the summer of 1989. This event had a big 
impact on the Hong Kong stock market such that HSI has some nonnormal 
factors during this period. The event affected the Hong Kong stock market 
until 1992 and after that, its effect gradually disappeared. No other exciting 
event occurred and the HSI was rather quiet during 1992-1996. Our findings 
here seem to match this real circumstance very well. 

4. Some general results. This section contains some general results use- 
ful for obtaining various approximations in hereroscedastic nonlinear time 
series. The first result is an extension of Theorem 1.1 in [17]. To state these 
extensions, let {T]ni,'yni,'Tni,S,ni)A < i < n, he an array of 4-tuple r.v.'s de- 
fined on a probability space such that {r^ni^l < ^ < stre i.i.d. according 
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to a d.f. H, and rjni is independent of {'~fni,Tni,(,ni)i I <i <n. Furthermore, 
let be an array of sub cr-fields such that Ani C Ani+i, 1 < i < n,n > 

1; (7ni,<5ni,Cni) IS Ani measurable, ther.v.'s {r]ni, . . . ,r?„j_i; (7™, r„i, ^„i)> 1 < 
i < j} are ^„j-measurable, 2 < j < n; and r]nj is in dependent of Anj, 1 < 
j < n. Define, for 2; E M, 



Vn{x) := n ^ lnil{r]ni <X + XTni + Cni), 
1=1 
n 

Jn{x) := n~^^'^^-/niH{x + XTni + ^ni), 
1=1 

n n 

Vn{x) := n"^/^^7.„jl(r/„i <x + irii)-, Jn{x) :=n"^/^^7„ji?(x + ^„i), 

1=1 i=l 
n n 

V:{x) := n-^'^Y.^niI{Vni < x), J:ix) := n'^/^ ^7„i/?(x), 

1=1 1=1 

Un{x) := Vn{x) - Jn{x), Un{x) := Vn{x) - Jn{x), 

u:{x):=v:{x)-j:{x). 

Next, we introduce some assumptions: 

(4.1) H has a.e. positive density h with ||/i||oo := sup/i(x) < 00. 

(4.2) j \x\h{x)dx <oo. 

n 

(4.3) (a) n-'Y.^l = Opil), (b) max n-'/^\^ni\ = Op{l). 

1=1 

(4.4) (a) max = Op(l), (b) max |rni| = Op(l). 

l<i<n l<i<n 

A close inspection of the proof of Theorem 1.1 of [17] and the discussion 
on the bottom of page 544 there shows that if (4.1), (4.3), and (4.4)(a) hold, 
then 

(4.5) sup|C/„(x)-[/:(x)|=Op(l). 

This result has played a pivotal role in the development of a unified approach 
to the asymptotic distribution analysis of numerous inference procedures in 
homoscedastic time series models under minimal moment assumptions; see 
[15, 16]. We now state an analog of (4.5) suitable for heteroscedastic models. 
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Theorem 4.1. Under the above setup and under assumptions (4.1)- 
(4.4), 

(4.6) sup|C/„(x)-f/*(x)| =Op(l). 

The proof of (4.5) used a chaining argument with respect to the pseudo- 
metric df,{x, y) := sup|2|<^ 1-^^(3^ + ^) ~ H{y + z)!^/^, x, y e M, 6 > 0, appropri- 
ate for the location problem. If we let N{5, b) denote the cardinality of the 
minimal (5-net of (M, d^), then the crucial condition needed for this chaining 
argument is that, for some < &o < 1) /o {lnAA(u, 6o)}^^^ du < oo. On page 
544 of [17] it is argued that (4.1) implies this condition. 

The analog of the above metric that works in the current setup, that is, 
for the location-scale setup, is defined as follows. Let \z\ := \zi\ V \z2\, for any 
z := (zi,Z2)' G M?, and define 

Pb{x, y) = sup |F(x(l + zi) + Z2)- H{y{l + zi) + Z2)^/^ x, y G M, 6 > 0. 

\z\<b 

Let N{6,b) be the cardinality of the minimal 6-net of (M,/3(,) and let 

I{b):= / {In N{u,b)}^/'^du. 
Jo 

We shall now show that (4.1) and (4.2) imply 

(4.7) /(6)<c3o V0<6<1. 

Fix a < (5,5 < 1. Since H is continuous, there exists < Mis,M2S < oo 
such that 

H{-Mu) = 5^/2 and 1 - H{M2&) = 5"^ /2. 
When X, y G (-oo, -{b + Mu)/{1 - b)], 

Pb{x,y) < \H{x{l -b) + b) + H{y{l -b) + bf'^ < {2H{-Mu)Y'^ < 6. 
For x,y G [{b + Ad2s)/i^ — b),oo), using the monotonicity of H, 
Pbix, y) = sup \H{x{l + zi) + Z2) - H{y{l + zi) + za)!^^^ 

\z\<b 

< [|1 - H{x{l -b)-b)\ + \l- H{y{l -b)- b)\f' 

<{2{l-H{M2s))V^''<6. 

Partition the interval [-{b + Mis)/{l-b), {b + M25)/il-b)] as -{b + Mis)/{l- 
b) = xi < X2 < ■ ■ ■ < XNs = {b + M25)/(l - b) with Xk - Xk-i = <5V(2||/i||oo) 
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and Ns = 2(26 + Mis + M25)||/i||oo/[(l - b)5^], where [z] is the integer part 
of 2; G M. For each x,y £ {xk-i,Xk], using (4.1), we obtain 



1/2 

< [\\h\\oo\x - y|(l + < 6 (because b < 1). 



Pb{x,y) = sup|if(x(l + 21) + Z2) - H{y{l + zi) + Z2 

\z\<b 



With := E\rini\, by (4.2) and the Markov inequahty, Mu + M2S < 4/u/(^^, 
because 

1-2 

- = H{-Mis)<P{\Vnl\>Ml5)< ^ 



2 ....o,^^ vr,n.,^^.^.o,^^^^^, 

^ = l-H{M2s)<P{\Vnl\>M2s)< ^ 



2 M25 
Hence, using (5 < 1, we have 

2(26 + Mi5 + M25)||/i||oo 



N{5,h)<2 + Ns<2 + 



(l-6)(52 



^ 2{2h5^ + A^)\\h\\^ ^ 2{l-h)5^ + 2{2b5^ + A^l)\\h\l 



[I - h)5^ - (1 - h)5^ 

^ 2(l-6) + 2(26 + 4/x)||/t||oo _ Cb 
(1 - 6)54 54 • 

Note that Ch'-=2 + [(46 + 8^)||/i||oo/(l — 6)] is an increasing function of 6, 
with supo<f,<fe^ Ch = Ch^ < 00, for any 61 < 1. This proves (4.7). 

Thus, one can now repeat the arguments in [17] verbatim using the metric 
Ph instead of d^. No details are given here. However, we mention one typo 
that appears in the statement of Proposition 2.1 of [17] on page 552. The 
event [maxj|7„j| < 5/(1 + In A^((^, 6))"^/^] at (2.5) there has v}^"^ missing. It 
should be [maxi|7„i| <n^l'^5 / {I + \nN{5,b)Y/\ 

The above Theorem 4.1 is not enough to cover the cases where the 
weights and the disturbances Tni,Cni are functions of certain parame- 
ters and where one desires to obtain various approximations uniformly in 
these parameters, as is needed in the previous sections. The next result gives 
the needed extension of this theorem to cover these cases. Accordingly, let 
m > 1 be a fixed integer, lni,Vni,Uni be measurable functions from to 
R such that, for every t £ M™, {lni{t),Vni(t),Uni(t)) are independent of ijni, 
and ^„j-measurable, for each 1 <i<n. Let, for x G M, t G M™, 

n 

V(x, t) := n"^/^ lniit)I{l]ni <X + XVni{t) + Uni{t)), 
i=l 

n 

J{x, t) := l^i(t)H{x + xvniit) + u„i(t)). 
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(4.8) 

U{x,t) ■.= V{x,t)-J{x,t), 

n 

U*ix,t) := n-^/''Y.lni{t)[I{rim <x)- H{x)]. 

i=l 

To state the needed result, we need the fohowing assumptions. For each 

71 

(4.9) n~'Y.^lit) = Op{l), max n~'/^\lniit)\ = 0^(1), 

. f 1<1<71 

(4.10) max {\vniit)\ + \uni{t)\} = Op{l), 

l<i<n 

n 

(4.11) n-^'^Y.\^nim\Vni{t)\ + \Uni{t)\]=Op{l). 

1=1 

Ve > 0, 3 5 > 0, and an ni 9V0<6<oo,V||s|| <6, Vn>ni, 



(4.12) 



P[n ^^'^^\lni{s)\\ sup \Vni{t) - Vni{s) 
\ i=l Ht~s\\<5 



+ sup \uni{t)-Uni{s)\\ <e\>l-e, 

\\t-s\\<5 



(4.13) P[ sup n~^/^Y.\^m{t)-lm{s)\<e]>l-e. 

\\\t-s\\<5 J 

The fohowing lemma gives the needed result. 

Lemma 4.1. Under the above setup and under the assumptions (4.1) and 
(4.9)~(4.13), for every 0<b<oo, sup^giRj|t||<fe |Z^(x, t) -U*{x,t)\ = Op{l). 

Proof. The proof of this lemma is similar to that of Lemma 8.3.2 in [16] , 
with the proviso that one uses the above Theorem 4.1 instead of Theorem 
2.2.5 of [16] whenever needed. □ 

Proof of Theorem 2.1. Proof of Theorem 2.1 follows from Lemma 4.1 
applied as follows. Fix a 1 < j < r, and let W22j,i denote the j'th component 
of the vector W22,i- Now in the above setup take 

_ ^l^{e + n-yH)-^ii{e) 
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^'■''^ TW) ' 

Rewrite in terms of the above notation: 

I{'ni{0 + n-^/'^t)<x)=I{'ni<X + XVni{t)+Uni{t)), 

Kn{x,9 + n-^/^t) =U{x,t) 

n 

+ ^4i(t)[i^0(x + XVniit) + Uniit)) - Fo{x)], 

i=l 

where in U, the d.f. H is replaced by Fq. 

The rest of the proof consists of verifying the conditions (4.9)-(4.13) for 
the entities given at (4.14), the details of which are similar to those appearing 
in the proof of Lemma 8.3.1 in [16], hence omitted for the sake of brevity. 
□ 

Proof of Theorem 2.2. The result follows by use of Le Cam's third 
lemma (cf. [30], Theorem 3.10.7), as soon as we verify the contiguity of 
Pin to Pon- The sequence of probability measures {-Pin} here depends on n 
through the change in direction of the error distribution, while in [8], analo- 
gous alternative sequences depend on n through the change in a Euclidean 
parameter. Thus, their results of LAN are inapplicable. 

Now, let /c = / + c(/ - /) and C = (/-/) / V7) I c| < 1 and / be the density 
of F. Assume that < = JC'^{x)f{x) dx < oo. Let A„(c) : = EL i log(/c(r/i)/ 
/(ryi)) denote the log-LR of Pf„ to Pon- Verify that c'^ /[/^- - 
^c({x)^/J{x)]'^ dx — > 0, as c — > 0. From the proof of Theorem 2.1 in [26], 
under Pon it follows that 

n 
i=l 

By the central limit theorem, it now readily follows that, under P^n, n~^^'^ x 
Z]r=i Ci(^i(^)) ~^ -^(O)O'^)) which in turn implies that Pin is contiguous to 

Pon- □ 
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